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We study the sensitivity of weak lensing surveys to the effects of catastrophic redshift errors 
— cases where the true redshift is misestimated by a significant amount. To compute the biases 
in cosmological parameters, we adopt an efficient linearized analysis where the redshift errors are 
directly related to shifts in the weak lensing convergence power spectra. We estimate the number 
Nspec of unbiased spectroscopic redshifts needed to determine the catastrophic error rate well enough 
that biases in cosmological parameters are below statistical errors of weak lensing tomography. 
While the straightforward estimate of A'apec is ~ 10^, we find that using only the photometric 
redshifts with 2 < 2.5 leads to a drastic reduction in N^pec to ~ 30, 000 while negligibly increasing 

■ statistical errors in dark energy parameters. Therefore, the size of spectroscopic survey needed to 
control catastrophic errors is similar to that previously deemed necessary to constrain the core of the 

Mh ' Zs — Zp distribution. We also study the efficacy of the recent proposal to measure redshift errors by 

, cross-correlation between the photo-z and spectroscopic samples. We find that this method requires 

~ 10% a priori knowledge of the bias and stochasticity of the outlier population, and is also easily 
confounded by lensing magnification bias. The cross-correlation method is therefore unlikely to 
supplant the need for a complete spectroscopic redshift survey of the source population. 

u 

• , I. INTRODUCTION 

I ■ Weak gravitational lensing is a very promising cosmological probe that has potential to accurately map the distri- 
O I bution of dark matter and measure the properties of dark energy and the neutrino masses (for reviews, see [1-4]). It 
■^-^ . is well understood, however, that systematic errors may stand in the way of weak lensing reaching its full potential 
^ ' — that is, achieve the statistical errors predicted for future ground and space based surveys such as the Dark Energy 
Survey (DES), Large Synoptic Survey Telescope (LSST), and the Joint Dark Energy Mission (JDEM). Controlling 
the systematic errors is a primary concern in these and other surveys so that a variety of dark energy tests (recently 
proposed and reviewed by the JDEM Figure of Merit Science Working Group [5] ) can be performed to a desired high 
accuracy. 

QQ ] Several important sources of systematic errors in weak lensing surveys have already been studied. Chief among 
f — . . them is the redshift accuracy — approximate, photometric redshifts are necessary because it is infeasible to obtain 
C^l ■ optical spectroscopic redshifts for the huge number (~ 10^-10^) of galaxies that future surveys will utilize as lensing 
, sources. It is therefore imperative to ensure that statistical errors and systematic biases in the relation between 
' photometric and spectroscopic redshifts (recently studied in depth with real data [6-10]) do not lead to appreciable 
0^ I biases in cosmological parameters. 

' The relation between the photometric and spectroscopic redshift has been previously modeled as a Gaussian with 
^ I rcdshift-dcpcndcnt bias and scatter. It is found that both the bias and scatter (that is, quantities {zp — Zg) and 

■ [{zp — Zg)^)^/^ in each bin of Az ~ 0.1), need to be controlled to about 0.003 or better in order to lead to less than 
r> ' ^ 50% degradation in cosmological parameter accuracies [11-13]. These constraints are a bit more stringent in the 

■ most general case when the redshift error cannot be described as a Gaussian [14]. These requirements imply that 
" " ' -/Vgpoc ^ 10^ spectra are required in order to calibrate the photometric redshifts to the desired accuracy. Gathering 

such relatively large spectroscopic sample will be a challenge, setting a limit to the useful depth of weak lensing 
surveys. [While the redshift errors have been well studied, other systematics are also important, especially theoretical 
errors in modeling clustering of galaxies at large and small scales, intrinsic shape alignments, and various systematic 
biases that take place during observations [15-50].] 

All of the aforementioned photo-z requirement studies (e.g. [11, 12]), however, have modeled the errors as a per- 
turbation around Zs — Zp relation. While this perturbation was allowed to be large and to have a nonzero scatter and 
even skewness (e.g. [11]), it did not subsume a general, multimodal error in redshift. 

In this paper we would like to remedy this omission by estimating the effect of catastrophic redshift errors. 
Catastrophic errors are loosely defined as cases when the photometric redshift is grossly misestimated, i.e. when 
\zp — Zs| ~ 0(1), and are represented by arbitrary "islands" in the Zp — Zg plane. We develop a formalism that treats 
these islands as small "leakages" (or "contaminations") and directly estimates their effect on bias in cosmological 
parameters. We then invert the problem by estimating how many spectroscopic redshifts are required to control 
catastrophic errors at a level that makes them harmless for cosmology. 
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The paper is organized as follows. In §11 we derive the relevant equations for the bias in cosmological parameters 
induced by misestimated catastrophic redshift errors in a tomographic weak lensing survey. In §111 we apply these 
methods to a canonical ambitious weak-lensing cosmology project. In §IV we ask: how large must a complete 
spectroscopic redshift survey be in order that the catastrophic photo-z error rates be measured sufficiently well that 
remnant cosmological biases are well below the statistical uncertainties? Newman [51] has suggested an alternative 
mode of measuring the photo-z error distribution, namely the angular cross-correlation of the photometric galaxy 
sample nominally at Zp with a spectroscopic sample at Zg; in §V we investigate whether systematic errors in the 
photo-z outlier rates derived from this cross-correlation technique will be small enough to render cosmological biases 
insignificant. The final section discusses the scaling of these results with critical survey parameters, the ramifications 
for survey design, and areas of potential future investigation. 



II. FORMALISM 



In this section we establish the formalism that takes us from "islands" in the Zs — Zp plane to biases in cosmolog- 
ical parameters. First, however, we define the basic observable quantity, the convergence power spectrum, and its 
corresponding Fisher information matrix. 



A. Basic observables and the Fisher matrix 



The convergence power spectrum of weak lensing at a fixed multipole i and for the ith and jth tomographic bin is 
given by 

where r{z) is the comoving distance, H(z) is the Hubble parameter, and 7^mat(fc, z) is the matter power spectrum. The 
weights W^ are given, for a flat Universe, by Wi(x) = | gi{x) (1 + z) where gi{x) =xj^ dxsni{xs){Xs-X)/Xs, 

X is the comoving distance and rii is the comoving density of galaxies if Xs falls in the distance range bounded by the 
ith redshift bin and zero otherwise. Wc employ the redshift distribution of galaxies of the form n{z) oc cxp(— z/zq) 
that peaks at 2zo ~ 0.9. 

The observed convergence power spectrum is 

c:,{e) = v:^{t) + 5.,^^, (2) 

where (7int)^^^ is the rms intrinsic shear in each component which we assume to be equal to 0.24, and fii is the average 
number of galaxies in the ith redshift bin per steradian. The cosmological constraints can then be computed from the 
Fisher matrix 

F,=5:^Cov-i|^, (3) 

^ dp., dpj 

where Cov~^ is the inverse of the covariance matrix between the observed power spectra. For a Gaussian convergence 
field, its elements are given by 

Gov [C« (£) , [1)] = ^^^^^^'^^^^^^ [Ctk m + Ct, {i)C^, {£)] . (4) 

where /gi^y is fraction of the sky observed and A£ is the binning of the convergence power spectra in multipole. 

Our fiducial SNAP survey described below, without any theoretical systematics, determines wq and Wa to accuracies 
of <j{wq) = 0.089 and a{wa) = 0.31 (corresponding to the pivot value determined to cr(wp) = 0.027). 



B. Biases in the Gaussian limit 



Gonsider the general problem of constraining a vector of cosmological parameters P = {pi} based on an observed 
data vector D = {Da}. If the observable quantities Da are distributed as Gaussians with covariance matrix C, then 
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the first-order formula for bias in the ith parameter, Api, induced by a bias AD in the data is (e.g. [52. 53]) 



Ap. = E JT^i^^'U ^Dp. (5) 

J a/3 "^P' 



Here F is the Fisher matrix for the cosmological parameters, and is defined as the second derivative of log likelihood 
(C = — InL) with respect to the parameters. The bias above can be more concisely expressed as 

AP ^ F-^Q AD = F-^V, (6) 

where we have defined the matrix Q and vector V as 

V = QAD. (8) 

The induced parameter bias is considered unimportant if it is small compared to the expected statistical variation 
in the cosmological parameters. In the case where the likelihood in the parameter space is Gaussian, the likelihood 
of the bias AP being exceeded by a statistical fluctuation is determined by 

Ax^ = AP^FAP ^ V^F~^V (9) 

In the Appendices we prove two useful theorems about A^^: 

1. The bias significance A^^ always decreases or stays fixed when we augment the likelihood with (unbiased) prior 
information, e.g. data from a non-lensing technique; 

2. Ay^ always decreases or stays fixed when we marginalize over one or more dimensions of the parameter space. 
In the Gaussian limit, the bias Api is unaffected by marginalization over other parameters. 

We will use these results later to argue that our requirements on the control of catastrophic redshift errors are 
conservative, in the sense that adding other cosmological data or considering individual cosmological parameter 
biases will only weaken the requirements. 



C. The case of catastrophic photo-z errors 

For weak-lensing tomography, the data elements Da are the convergence (or shear) cross-power spectrum elements 
C^p{£) between photo-z bins a and (3 at multipole £. Let us examine how these will be biased by photo-z outliers. 
[The data covariance matrix C of §11 B is the matrix Gov of Eq. (4).] 

We assume a survey with the (true) distribution of source galaxies in redshift ng{z), divided into some number iVb 
of bins in redshift. Let us define the following terms 

• Leakage: fraction of objects from a given spectroscopic bin that are placed into an incorrect (non-corresponding) 
photometric bin. 

• Contamination: fraction of galaxies in a given photometric bin that come from a non-corresponding spectroscopic 
bin. 

One could estimate either of these quantities — after all. when specified for each bin, they contain the same 
information. Let leakage fraction 1st of galaxies in some spectroscopic-redshift bin ns (the "source" of leakage) end 
up in some photo-z bin nr (the "target" of leakage), so that 1st is the fractional perturbation in the source bin. Note 
that, since bins S and T may not have the same number of galaxies, the fractional perturbation in the target bin is 
not the same. The contamination of the target bin T, cst, is related to the source bin leakage via 

Ns 

CST = irr- 1st (10) 

i\t 

where Ns and Nt are the absolute galaxy numbers in the source and target bin respectively. 
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FIG. 1: Explanation of how the leakage and contamination operate. In this figure, we assume for simplicity that the number 
of galaxies in the source and target bin is the same, so that 1st = cst- Because the redshift distribution n{z) is normalized to 
unit integral in each bin, the source bin's redshift distribution ns{z) does not change; see the bottom left panel. The target 
bin's redshift distribution, nriz), does change however, as illustrated in the bottom right panel. 

The redshift distribution of galaxies (normalized to unity) in the source bin, ns, does not change, since a fraction 
of galaxies is lost — but the redshift distribution is normalized to unit integral; see Fig. 1. Conversely, things are 
perturbed in the target bin, since it now contains two populations of galaxies, the original one with fraction 1 — cst, 
and the contamination at incorrect (source bin) redshift with fraction cst', again this is clearly shown in Fig. 1. 
Therefore 



ns — > ns 

nT ^ {1 ~~ Cst) it-t + cst ns 



(11) 
(12) 



and only the target bin is affected (i.e. biased) by photo-z catastrophic errors. 

The effect on the cross power spectra is now easy to write dowir. Clearly, only the (a,/3) cross-spectra where one 
of the bins is the target bin — a = T or (3 = T — will be affected 



Ctt ^ — cst^Ctt + '2cst{^ ~ cst)Cst + c^j^Css 

CaT ^ (1 - CsT)CaT + CsT CaS (" ^ T) 

Ca0 — > Ca(3 (otherwise) 



(13) 



We have checked that ignoring the quadratic terms in Ca/j leads to no observable effects to the results (for cst = 0.001 
contamination). The biases can now be computed as the right hand side minus the left hand side in the formulae 
above. We replace the single index a for data elements in Eq. (5) with the triplet iaf3 so that D^ap = and we 

reserve the symbol C for the covariance of the data elements. The bias in data induced by the catastrophic errors is 



(14) 



If we make the further assumption that the convergence is a Gaussian random field, then we have 



w 
2" 



(15) 
(16) 
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Eq. (5) simplifies considerably when we invoke Eqs. (14) and (16): 

Ap, = ^(F-i),,A//j,^,c^„ (17) 

Mj,^^ ^ E - (^')-] ' (18) 

As a reminder, the Fisher matrix in the case of a zero-mean Gaussian variable is [54] 

In a cosmological application we will marginalize over all parameters except a subset of interest A. In the Fisher 
approximation bias is simply projected onto the A subset: /S.pA = PaAj), where Pa is the projection matrix (see 
Appendix B). If Fa is the marginalized Fisher matrix, then the Ay^ of the bias after marginalization is 

= E E ^A^-V-' E ^^^M- {F~^pIFaPaF-^)^^ Af,,^v'. (21) 

All/ ij 

III. APPLICATION TO CANONICAL SURVEYS 

For first study we examine a weak lensing survey similar to that proposed for SNAP [55] , but with the source-galaxy 
selection restricted to incur minimal catastrophic error rate. Evaluation of other potential surveys could be performed 
following the same model. 

We take the eight-parameter cosmological model considered by the Dark Energy Task Force (DETF; [56]): dark 
energy physical density f2DE^^, and equation of state parameters and Wa] normalization of the primordial power 
spectrum A and spectral index n; and matter, baryon, and curvature physical densities JIm^^, ^sh^ , and flkh^. The 
fiducial values of these parameters are taken from the 5-year WMAP data [57] . We will assume a Planck CMB prior as 
specified by the DETF report. Recall that application of further priors can only weaken the requirements on photo-z 
outliers (see Appendix A). 

We assume shear tomography with 20 bins linearly spaced over < 2 < 4 with Az = 0.2; we have checked that 
the results are stable with Az. The redshift distribution and fiducial Cap are taken from a simulation of the photo-z 
performance of SNAP as described in [58]. The procedure is to (1) create a simulated catalog of galaxies; (2) calculate 
their noise-free apparent magnitudes in the SNAP photometric bands spanning the visible and NIR to 1.6 fim; (3) 
add the anticipated observational noise to each magnitude; (4) determine a best-fit galaxy type and redshift with the 
template-fitting program Le Phare^; (5) examine the 95% confidence region zi < Zp < Zh determined by Le Phare and 
retain only those galaxies satisfying [59] 

D95 = In (^Yrt) - ^^^^ 

This strict cut results in a catalog of w 70 galaxies per arcmin^, with a median redshift of z.^ w 1.2. The WL survey 
is assumed to cover /sky = 0.1 of the full sky, with shape noise of — 0.24 per galaxy. We consider only shear 
tomography at the 2-point level, as this will likely maximize the bias imparted by redshift errors. We also ignore 
systematic errors other than redshift outliers, which will likely maximize the statistical significance of the outlier bias. 
We will henceforth in this paper define a redshift outlier to satisfy 



1 + Zs 



> 0.2 (catastrophic outlier definition). (23) 



In the simulated photo-z catalog, 2.2% of the source galaxies are outliers by this criterion. In our analyses below we 
will only consider biases from photo-z errors meeting this outlier criterion, i.e. we assume the "core" of the photo-z 
distribution is well determined. 



1 www.oamp.fr/pcoplc/arnouts/LE PHARE.html 
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FIG. 2: Left: The contamination rate Csp/Azs of the photo-z bin per unit redshift in spectro-z is plotted for our example case. 
Note that the contamination is highest at Zp > 2.5 and zp < 0.2, where there are relatively few source galaxies and hence a 
small number of outliers can become a large fractional contamination. Right: The quantity hsp which specifies the significance 
of the Wo — Wa bias caused by a contamination rate of 0.001 Szp across a range Szp of photo-z. This plot indicates that the 
outlier contamination rates must be known to 1-3 parts per thousand over all photometric redshift bins, most sensitively at 
0.3 <zp< 1.5. 



Figure 2 illustrates the canonical model and the sensitivity to redshift outliers in this model. The left-hand panel 
shows the quantity c^p/Az vs and Zp. [We scale the contamination by Az to produce a quantity that is independent 
of the choice of bin size Az.] The highest contaminations are in two "islands": one at Zp > 2.5, Zg < 0.6 is probably 
due to confusion between high-z Lyman breaks and low-z 400-nm breaks. Because true z > 2.5 galaxies arc relatively 
rare, a small leakage rate from Zs ^ 0.5 can produce a high contamination fraction. The Le Phare code run for this 
simulation does not incorporate a magnitude prior for the photo-z; doing so might reduce the size of this island. 

A second high-contamination region is Zs « 1, Zp < 0.2. Again the contamination rate is high because the target-bin 
density is much lower than the source-bin density. 

The right-hand panel in Figure 2 shows Ax^ evaluated using Eq. (21) for this case of catastrophic errors. We 
calculate the significance Ax2d of the bias after marginalization of the cosmology onto the wq — Wa plane. We simplify 
by considering the bias arising from contamination in a single bin. This is 

AxL = {KuAzfcl^, (24) 
h% ^ (P^'PaPaPaF-^)^^ M,,^,/{Az)'. (25) 

Again the inclusion of the Az factor defines a h^^ that is invariant under rebinning. The interpretation of /i^i/ is as 
follows: if there is an "island" of outliers that spans a range Szp of photo-z bins, and contains a fraction c of the 
galaxies in these photo-z bins, then the 2d significance of the resultant bias will be 

^/A^ « hf^jzpc. (26) 

Figure 2 has been scaled by 1000, so that it indicates the bias significance of a contamination rate c ~ 0. 001/(5 Zp. We 
desire Ax2d ^ 2.3 to keep the bias well within the 68% confidence contour. The most severe constraint on c would 
be to take the peak value hp^i, ~ 1300 and a very wide island, Szp w 0.5, in which case the criterion for small bias 
becomes 

c < l/{h^Jzp) « 1/(1300 * 0.5) = 0.0015. (27) 

The contamination rate into any island of outliers must be known to 0.0015 or better to avoid significant cosmological 
bias. This conclusion is independent of the nominal outlier rate. The tolerance on outlier rate will scale with sky 
coverage as f'^l^^- 
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IV. CONSTRAINT VIA SPECTROSCOPIC SAMPLING 



The most obvious way to determine the contamination rate Cap is to conduct a complete spectroscopic redshift 
survey of galaxies in photo-z bin (3. It is of course essential that the spectra be of sufficient quality to determine 
redshifts even for the outliers in the sample. 

Let us now estimate the total number of spectra Ngp^c required in order to keep the total bias below some desired 
threshold. We will assume that each redshift drawn from the spectroscopic survey is statistically independent. In this 
case the distribution of Nafs, the number of galaxies in photo-z bin /3 that have spectro-z in bin a, will be described by 
a multinomial distribution. When the outlier rates are small, the number of spectra in each outlier bin tend toward 
independent Poisson distributions. 

We would like to relate the contamination uncertainties 6cij to the required number of galaxy spectra. Let iV^ be 
the number of spectra drawn from the photometric redshift bin (3 so that A'spcc = ^P- ^^^^^ i-^^afi) — Ca/iNis 
and the variance of the contamination estimate is 



Since the Poisson errors between different outlier bins are uncorrelated, the expected bias significance becomes 



(Ax") = Y,{h^pAzf{6clp) = Y,{KpAzfc^p/Np. (29) 

all all 

We would like to quote a total number of spectroscopic redshifts rather than the number per photo-z slice {Np above) in 
order to make our findings more transparent. We consider two cases: first, a slitless or untargeted spectroscopic survey 
will obtain redshifts in proportion to the number density of source galaxies in each redshift bin: Np = Nspccnp/n. 
Then we will consider a targeted survey, in which the number Np can be chosen bin-by-bin to produce the minimal 
bias for given total iVspoc. 



A. Untargeted spectroscopic survey 

In the untargeted case, Np — Nspccnp/n and the condition Ax^ <S 1 becomes (from Eq. (29)) 

A^spcc > hlfjCapn/np. (30) 

af3 

Figure 3 plots the summand of this expression in the Zg — Zp plane. The required A'spcc is hence the sum over values 
in this plane (note that we omit the bins near the diagonal that do not meet the "outlier" definition). We find that 
Ax2d ^ 1 requires A'spcc ^ 8 x 10^, and in the full 8-D parameter space (that is, considering Ax^ for the 8-dimensional 
parameter ellipsoid), we need A^spec ^ 2 x 10^. 

These requirements are daunting, potentially larger than the A^spec that are needed to constrain the core of the 
photo-z distribution as determined by Ma and Bernstein [14]. Note however that the requirement is strongly driven 
by the region Zs > 2.5, Zp < 0.8. This is because contamination rates are high here in the nominal photo-z distribution 
(Figure 2), and these bins are sparsely populated (small rip), meaning that many spectra must be taken in order to 
accumulate a strong enough constraint on these contamination coefficients. 

This suggests a strategy of omitting the Zp > 2.5 galaxies from the tomography analysis entirely. Omitting Zp > 2.5 
from the sum (30) produces a much relaxed requirement: for ^X2d ~ need A'spcc ^ 2.8 x 10"', a 30-fold reduction. 
This strategy would eliminate w 8% of the source galaxies in the SNAP model, and reduce the dark-energy constraint 
power by 18%, as mesured by the DETF figure of merit. This would be an acceptable strategy to reduce the outlier 
bias if one were unable to eliminate the high-Zj, island of outliers by refinements to the photo-z methodology. 
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FIG. 3: Number of calibration spectra required to attain an outlier bias significance of Axid = 1 is given by the integral of this 
plot over the Zs — Zp space. The quantity plotted is, from Eq. (30), h^pCepn/np, where Up is the source density in the photo-z 
bin and n is the total source density. The plot is in units of 10^ galaxies. Note that the required number of calibration spectra 
is strongly driven by the need to constrain the outliers with photo-z Zp > 2.5 but true redshifts Zs < 0.8. 



B. Targeted spectroscopic survey 

If we wish to minimize Ax^ in Eq. (29) for a given total A^spoc, a simple optimization yields 



(31) 



(32) 



Optimized targeting reduces the requirement for Axi,^ ^ 1 to be iVspoc ^ 1-2 x 10^. Eliminating the Zp > 2.5 sources 
reduces the requirement sixfold, iVspcc ^ 2.0 x 10"'. 

Note that the targeted redshift requires 7 times fewer calibration redshifts than the untargeted survey, if we are 
using the full source redshift range, but only 1.4 times smaller A'gpcc if we restrict Zp < 2.5 in the lensing analysis. 



C. Scaling and Robustness 



The required -/Vgpec to reduce outlier-rate biases to insignificance scales with the sky coverage and the mean outlier 
rate as 

A'^spoc tx /skyC (33) 

when most of the information is coming from shear tomography, and the depth of the survey is held fixed. We have 
verified that -/Vgpec varies little as the number of tomography bins grows (Az < 0.2) and the information content of the 
tomography saturates. The two bias theorems imply that the required Ngpcc will drop if we add additional unbiased 
prior information, or if we marginalize down to a single dark-energy parameter. 

More precisely, we find that the ratio Ngpcc x (o'(wp) x (^{wa)) = A'spoc/FoM (featuring the well known "figure of 
merit" [53, 56]) is roughly the same with several alternative survey specifications we consider (and is exactly the same 
if only the sky coverage /sky is varied). 
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We have used two independent codes to verify the robustness of results to the myriad of assumptions made and 
check for the presence of unwanted numerical artifacts. The two codes agree to roughly a factor of two in Ngp^c, which 
is satisfactory given the differences between the implementations, e.g. whether curvature and/or neutrino masses 
are free to vary, and whether the fiducial redshift distribution is smoothed over the cosmic variance in the simulated 
catalog. 

D. Correlated outlier errors and incompleteness 

So far we have considered the case where bin-to-bin fluctuations in contamination Scafs are uncorrelated. 
Contamination-rate errors Sca/3 that are correlated from bin to bin might arise if the spectroscopic survey systemati- 
cally misses outliers in certain redshifts islands (or if the spectroscopy is not done at all!). We can set a specification 
on the maximum allowable systematic contamination rate error Sc in an island of photo-z width dzp using Eq. (26). 
Our previous results for the canonical survey show that the contamination rate in the island should be known to 
6c ^ 0.0015. In other words a spectroscopic-rcdshift failure rate of only 0.15% in some range of Zp can cause a 
significant cosmology bias if all of these missed redshift arc outliers in a particular island. A 99.9% success rate has 
rarely if ever been achieved in a spectroscopic redshift survey. 

V. CONSTRAINING OUTLIER RATES USING GALAXY CORRELATIONS 

The above requirements on A'spoc and completeness may be too expensive to accomplish, particularly for fainter 
galaxies. We now examine the possibility that one could make use of a spectroscopic galaxy sample that does not 
fairly sample the photo-z galaxies [51]. The idea is to cross-correlate a photo-z sample at nominal redshift Zp with a 
spectroscopic sample known to be confined to a distinct bin Zg. The amplitude of this cross-correlation will tell us 
something about the contamination rate c^p, since there is no intrinsic correlation between the galaxy densities at the 
two disparate redshifts. 

Newman [51] calculates the errors in this estimate that would be induced by shot noise in the sample (for a somewhat 
related work, see [60]). Here we assume that statistical errors will be negligible and attend to two systematic errors 
that will arise. 



A. Outlier bias and correlation coefficients 

First, we define gs, gp to be the fiuctuations in sky density of the projected distributions of the spectroscopic-survey 
galaxies in bin s and the photometric-redshift galaxies in bin p. We set to be the fractional fluctuations in the 
projected mass density in redshift bin s. The bias is defined by (9^) = h1{m1). The amplitude of the measured 
cross-correlation between the angular distributions gs and gp of the spectroscopic and photometric samples can be 
written as 

{9s9p) = CspbsbspTspiml) = Csp ^"^^"^ (gf), (34) 

where bgp is the bias of the outlier population, and 7'^^ is the correlation coefficient between the density fluctuations 
of the spectroscopic and the outlier populations. The outlier population arc those minority of sources in photo-z bin 
p that have spectroscopic redshifts in bin s. 

In a large survey, shot noise in (gsgp) and in (g^) might become small, and 6s could be determined to good accuracy 
through study of the redshift-space power spectrum of the spectroscopic targets. It will be difficult, however, to discern 
bsp because the angular correlation signal of the outlier population is overwhelmed by the correlations of the galaxies 
in the "core" of the photo-z error distribution (i.e. photo-zs which have not been catastrophically misestimated). 
The correlation coefficient rsp also has no alternative observable signature we have identified. Hence there will be a 
systematic-error floor on Scsp arising from the finite a priori knowledge of the product bgprsp. 

B. Lensing Magnification 

The second complication to the cross-correlation method is that gravitational lensing magnification bias will induce 
a correlation between the spectroscopic and photometric samples even if there is no contamination. Let us assume 
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that the spectroscopic sample is in the foreground of the photo-z "core" ; a similar analysis can be done when using 
cross-correlation to search for contamination by background galaxies. There are two types of magnification-induced 
correlations. Following the notation of [50], there is a "GG" correlation, in which both the spectro and photo galaxy 
samples are lensed by mass fluctuations in the foreground of both. Then there is a "GI" effect, in which the mass 
associated with the fluctuations in the foreground sample induces magnification bias on the background sample. 

The GI correlation is as follows: Let the spectroscopic bin Zs span a range Ax^ in comoving radial distance. The 
matter fluctuations m,, induce a lensing convergence on the photo-z bin at Zp of 

Kp = ——Axs rus, (35) 

^ Xp 



where tUm = Q.mh'^ = 0.127 is the comoving matter density, Xp Xs are the comoving angular diameter distances 
to Zs and Zp, and we have assumed a flat Universe. 

The lensing magnification will induce apparent density fluctuations in the background sample as 

g;,"- = qpKp, (36) 

where ?p is a magniflcation bias factor for the galaxies in the photo-z bin. For instance if the selection criteria for the 
bin were a simple flux limit, and the intrinsic flux distribution were a power law dn/df oc then we would have 
(7p = 2s — 2. In general qp will be of order unity. 

The foreground galaxy distribution has a correlation coefficient with the mass m^, hence a covariance between 
populations results: 

{9s9p)gi = ^^^Xs— — —qpbsrsiml) (37) 
^ Xp 

= -^^Xs-^^ qp-r\9s) (38) 

^ Xp 

(we have ignored shot noise in the galaxy auto-correlation). This lensing contamination will have to be subtracted 
from {gsSp) in order to extract the information on contamination Csp. Even if all the cosmological factors are well 
determined, the magniflcation coefficient qp will have to be empirically estimated. Finite accuracy in this estimate 
will increase 5csp- 

The GG correlation scales as follows: let k,s be the convergence induced on the foreground (spectroscopic) sample 
by mass at z < Zg- This produces density fluctuations gs = qs^s- This mass induces convergence Hp w f^si^iXsTXp) 
on the background (photo-z) source population, where r is an integral involving the distributions of foreground mass 
which must satisfy r > 1. Not concerning ourselves with details, we take r = 1. The induced angular correlation will 
be 

{9s9p)gg = qsqpr{nl)- (39) 

Typical RMS values Kg are 0.01-0.02 at cosmological distances. The GG lensing correlation must be removed from 
the signal to retrieve the contamination fraction, and again even if there is no shot noise and all distances and lensing 
amplitudes are known perfectly, the values of qs and qp will only be known to flnite precision. 

C. Estimate of systematic errors 

Summing the GG, GI, and intrinsic contributions, the cross-correlation between spectroscopic and photometric 
samples is 

{9s9p) = (^Ax.^^^^gp? + cj-^] {gl) + q^qprinl) (40) 



2 Xp 'bs °" b 

1 r6s(5s5p) bsqsqp{nl) 3w™ Xp - Xs 

^Xs Tsqp 



bspi^sp 



{9's) (g's) 2 Xp 



(41) 



All of the right-hand quantities are potentially well measured from the survey data itself or from other cosmological 
probes, except the outlier covariance factor bspVsp and the magnification coefficients qp and qp. Uncertainties in the 
a priori assumed values of these factors will propagate into the contamination coefficient as 

{Scspf « [5{bsprsp)f c% + 5ql (^^Ax.^^)' + {6ql + <5<z?) (i^)' (42) 
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Here we assume 5 w r w 1, w gp. 

Earlier we showed that contamination into an outher "island" should be known to Sc^p < 0.0015 to avoid significant 
parameter bias. Can such a small contamination be measured using the cross-correlation technique? 

• If the nominal outlier rate is Csp ~ lOScgp ~ 0.015, then we require a prior estimate of outlier bias/covariance 
accurate to <5(&sp^sp) < 0.1. Little will be known about the outlier population besides its luminosity range, and 
the outliers may tend to be active galaxies or those with unusual spectra whose clustering properties might 
be deviant as well. We would consider a 10% prior knowledge on outlier bias to be optimistic but perhaps 
attainable. 

• For the second (GI) term, if we take the distance factors to be w 1, and the outlier population to span a range 
Axs ~ 0.3, then the magnification bias coefficient must be known to an accuracy of Sqp ^ 0.025. This accuracy 
in Qp will be challenging to achieve. If the galaxy selection is by simple magnitude cut, then the slope of the 
counts yields qp and potentially could be measured to high precision. Weak lensing samples typically have more 
complex cuts and weightings, however, than a simple flux cutoff. Surface brightness, photo-z accuracy, and 
ellipticity errors arc involved, making estimation of qp more difficult. 

• The third (GG) term places constraints on 6qp and Sqs that will generally be weaker than those from the GI 
term. 

If the cross-correlation technique is to determine outlier contamination fractions to an accuracy that renders them 
harmless, then we will need to know the hspTsp product of the outlier population to 0.1 or better, and also must know 
the magnification-bias coefficients q of our populations to 2% accuracy. This is true regardless of sample size, and 

— 1/2 

these tolerances will scale as f^^^ . The demands on S{bsprgp) also becomes more stringent linearly with the photo-z 
outlier rate. 

We have not considered the possibility of extraneous angular correlations induced by dust correlated with the 
foreground galaxy sample, or by dust in front of both samples (B. Menard, private communication). This signal will 
be present to some degree, though may perhaps be diagnosed with color information. 

In summary, while we have shown here that the cross-correlation technique proposed by [51] is sensitive to catas- 
trophic redshift errors, we found that prospects of measuring these errors (that is, the contamination coefficients c^p) 
will be difficult using this technique alone. 

VI. DISCUSSION AND CONCLUSION 

In this paper we have considered the effects of the previously ignored catastrophic redshift errors — cases when the 
photometric redshift is grossly misestimated, i.e. when \zp — Zs\ ~ 0(1), and are represented by arbitrary "islands" 
in the Zp — Zg plane. We developed a formalism, captured by Eqs. (13), that treats these islands as small "leakages" 
(or "contaminations" ) and directly estimates their effect on bias in cosmological parameters. We then inverted the 
problem by estimating how many spectroscopic redshifts are required to control catastrophic errors at a level that 
makes them harmless for cosmology. In the process, we have proven two general-purpose theorems (in the Appendices): 
that the bias due to systematics always decreases or stays fixed if 1) (unbiased) prior information is added to the 
fiducial survey, or 2) we marginalize over one or more dimensions of the parameter space. 

We found that, at face value, of order million redshifts are required in order not to bias the dark energy parameter 
measurements (that is, in order to lead to A^^ ^ 1 in the wq — Wa plane). However, the requirement becomes 
significantly (30 times) less stringent if we restrict the survey to redshift z < 2.5; in that case, A^spoc is only of order 
a few tens of thousands. Essentially, leakage of galaxies from lower redshift to z > 2.5 is damaging since there are 
few galaxies at such high redshift and the relative bias in galaxy number is large. Therefore, using only galaxies with 
z ^ 2.5 helps dramatically by lowering the required iVspcc while degrading the dark energy (figure-of-merit) constraints 
by mere ~ 20%. 

We have studied two approaches for a spectroscopic survey: the untargeted one where the number of spectra at 
each redshift is proportional to the number of photometric galaxies (§IV A) and the targeted one where the number 
of spectra is optimized to be minimal for a given degradation in cosmological parameters (§IVB). For the case where 
galaxies with Zp > 2.5 are dropped, the targeted survey gave only a modestly 40%) smaller required A^spoc- 

We do not imply that these Nspcc requirements to apply to all proposed surveys to high accuracy, although the 
O(10~'^) required knowledge on catastrophe rates is robust. The calculation should be repeated with the fiducial 
photo-z outlier distribution, survey characteristics, and cosmological parameters of interest to a particular experiment. 

Our work demonstrates for example that efforts to reduce the "island" of catastrophic mis-assignment from z w to 
z ft! 3, such as magnitude priors, could greatly reduce the required Ngpec- Since A'spoc oc /skyC (with c being the mean 
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rate of catastrophic contamination), it is clear that a photo-z survey with improved S/N and wavelength coverage to 
reduce the total catastrophe rate will also require lower Nspec to calibrate these rates. 

Another practical implication of these results is that the spectroscopic redshift surveys must be of very high 
completeness — 99.9% if there is a possibility that all failures could be in an outlier island, but less if some fraction of 
the failures are known to be in the core of the error distribution. 

If an outlier island is known to exist at a particular (zs, Zp) location, it may be possible to include the contamination 
Csp as a free parameter in the data analysis and marginalize over its value. It is possible that self-calibration may 
reduce the bias in cosmological parameters. It is likely infeasible, however, to leave the Csp values over the full (zg, Zp) 
plane as free parameters. We leave self-calibration of outlier rates for future work. 

We have also studied whether the technique proposed by Newman [51], which correlates a photometric sample 
with a spectroscopic one, can be used to measure, and thus correct for, catastrophic redshift errors. The advantage 
of this approach is that the spectroscopic survey need not be a representative sampling of the photometric catalog. 
While we found that the cross-correlation technique is sensitive to catastrophic errors (specifically, the contamination 
coefficients Csp), the contamination coefficient is degenerate with the value bsprgp of the bias and stochasticity of 
the outlier population. Furthermore there is a correlation induced by lensing magnification bias that spoofs the 
contamination signal. It will therefore be difficult to use the cross-correlation technique to constrain outlier rates to 
the requisite accuracy. 

Overall, we are very optimistic that the catastrophic redshift errors can be controlled to the desired accuracy. We 
have identified a simple strategy that requires only of order 30,000 spectra out to z ~ 2.5 for the calibration to be 
successful for a SNAP-type survey. Incidentally, this number of spectra required for the catastrophic errors is of the 
same order of magnitude as that required for the non-catastrophic, "core" errors [11, 12, 14]. Total spectroscopic 
requirements of a survey will be based on the greater of requirements of these two error regimes. 
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APPENDIX A: EFFECT OF UNBIASED PRIORS ON BIAS SIGNIFICANCE 

Will a bias get worse or better (more or less significant) when additional unbiased prior information is added to the 
likelihood? It is intuitive that biases AP should decrease when unbiased information is added. However F F+G for 
some new non-negative-definite prior Fisher matrix G, meaning that the statistical errors also shrink. So which effect 
wins out? We prove here that addition of unbiased prior information cannot increase the significance of parameter 
bias. 

The proof is straightforward: Eq. (9) gives the significance Ax^ of a bias in terms of the original positive-definite 
Fisher matrix F and the vector V. If G is a non-negative-definitc prior, then the change significance of the bias is 

Ax^(with prior) - Ax^(without prior) = V'^{F + Cy^V - V'^ F~^V (Al) 

This quantity cannot be positive. If it were, then there would some < Aq < 1 and a positive-definite matrix 
H = F + XqG such that 

< ^ [V^{F + XG)~^V]^^^ = -{H-^VfGH-^V. (A2) 

If G is non-negative definite, this situation cannot occur. We hence conclude that the A^^ of some bias is always 
reduced (or stays the same) by addition of an unbiased prior. 

APPENDIX B: EFFECT OF MARGINALIZATION ON BIAS SIGNIFICANCE 

Second we can ask: If we calculate the significance of a bias induced over a parameter space, then marginalize 
away parameter vector B to leave parameter vector A, how might the significance differ in the smaller space? We 
show that in the Gaussian limit, marginalization always reduces (or leaves unchanged) the Ax^ assigned to the bias, 
although the A^^ per DOF may increase. To see this: first we note that marginalization over B docs not change the 
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biases in the parameters A if the distribution is Gaussian. So the bias in A is simply a projection matrix Pa times 
AP: AP4 = PjiF~^V. The Ax^ after marginaUzation down to the A space is determined by the marginahzed Fisher 
matrix, F' = [{F~^)aa]~^ ■ So wc have 



The equivalence in (Bl) can be derived from manipulation of the common expression for the inverse of a matrix 
decomposed into an 2 x 2 array of submatrices. Because Feb and its inverse must be non-ncgativc-dcfinitc, the last 
term is negative, so we are assured that {Ax^)a < ^X^- Equality is, however, easily obtained, for example if there is 
no bias in the B parameters. We thus know that Ax^/iVooF can potentially increase. 
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